Skip to content
Play overlay
Preview this course

Data Engineering with Talend

Self-paced videos, Lifetime access, Study material, Certification prep, Technical support, Course Completion Certificate


Uplatz

Summary

Price
£13 inc VAT
Study method
Online, On Demand What's this?
Duration
16.7 hours · Self-paced
Qualification
No formal qualification
Certificates
  • Certificate of completion - Free
  • Reed courses certificate of completion - Free

2 students purchased this course

Add to basket or enquire

Overview

Uplatz provides this comprehensive course on Talend, one of the leading cloud ETL/DW tools. It is a self-paced course with video lectures. You will be awarded Course Completion Certificate at the end of the course.

Talend is an open source software platform which offers data integration and data management solutions. Talend specializes in the big data integration. The tool provides features like a cloud, big data, enterprise application integration, data quality, and master data management. Talend is an ETL tool for Data Integration. Talend has a separate product for all these solutions. Talend is a code management tool for open source applications. It offers various data processing and data management software and services. Talend has a separate product for all these solutions.

Talend Product Portfolio

Talend Enterprise Big Data: a big data integration solution that extends Talend Open Studio for Big Data with teamwork and management features

Talend Platform for Big Data: a powerful and versatile big data integration and data quality solution that simplifies the loading, extraction and processing of large and diverse datasets so you can make more informed and timely decisions

Talend Open Studio for Data Integration: an open source application for data integration job design with a graphical development environment

Talend Enterprise Data Integration: extends Talend Open Studio for Data Integration with technical support and additional features

Talend Platform for Data Management: turn disparate, duplicate sources of data into trusted stores of consolidated information

Talend Platform for Data Services: a comprehensive unified data, application and service integration solution that lessens the impact of changing data structures while making the management of data across domains easier.

Talend Open Studio for MDM: a set of functions for master data management that provides functionality for integration, quality, governance, mastering and collaborating on enterprise data

Talend Platform for Master Data Management: turn disparate, inconsistent information across a business into a single, reliable “version of the truth”, providing increased confidence in decisions made

Talend Open Studio for Data Quality: an open source data profiling tool that examines the content, structure and quality of complex data structures

The Advantages of Talend

  • Talend open studio cuts data handling time into half thus reducing developer rates.
  • Talend open studio is highly efficient and reliable while working on large datasets. Moreover, functional error occurrence is much lesser when compared to manual ETL.
  • Talend has a large community of users that can be utilized by the developers to locate any error during the development of the ETL job.
  • It provides multiple open source integration tools free of cost to the users.

Main Features of the Talend

  • Repository - is the collection of technical components used in a job. This panel is also called the “Heart of Talend Open Studio”. In this, metadata of databases, table schemas and structure can be created and stored.
  • Design Workspace - Talend Studio’s next feature is the Design Workspace Window, here jobs can be designed and modeled with the help of a designer tab that shows the work graphically, and the code tab to detects possible errors and read the generated code.
  • Component Palette - The next important feature in Talend open studio is Palette, which is used to contain the various components required to build a job. The component palette is used as a preconfigured connector to perform the specific data integration operation and it can also reduce the amount of hand-coding needed to work on multiple data.

Talend Course Learning Outcomes

  1. Get a thorough knowledge of ETL ideas as well as the capacity to address real-time business challenges with Talend
  2. Know how to use the Talend tool to automate all of your data integration, data analysis, and data warehousing needs
  3. Interaction with various types of sources or target platform like Flat Files (CSV, Fixed width), XML, Excel and work with Databases
  4. Understand real-time scenarios for Data Transformation, File & Error Handling, Talend Job Scheduling, and Automation/Parameterization
  5. Analyze the working knowledge of Big Data and Hadoop principles, as well as the advantages of combining Talend with Hadoop
  6. Talend provides easy connection and access to the Hadoop Ecosystem for integration, data analysis, and data warehousing applications.
  7. Get a thorough engagement of a SME throughout the Talend Training to understand industry standards and best practises
  8. Implementation of Talend with HDFS, Pig, and Hive (the most sought and futuristic abilities)

Curriculum

1
section
21
lectures
16h 41m
total
    • 1: Talend Introduction Preview 15:07
    • 2: Architecture and Installation - part 1 1:44:21
    • 3: Architecture and Installation - part 2 47:43
    • 4: File - Java - Filter Components 53:39
    • 5: tAggregateRow - tReplicate - tRunJob Components 59:56
    • 6: Join Components 57:34
    • 7: Sort and Looping Components 53:41
    • 8: Looping Components 24:20
    • 9: Context 1:11:25
    • 10: Slowly Changing Dimensions (SCD) 44:55
    • 11: tMap Components - part 1 31:16
    • 12: tMap Components - part 2 1:11:25
    • 13: tMap Components - part 3 13:44
    • 14: Talend Error Handling 56:03
    • 15: Audit Control Jobs 47:56
    • 16: How to use tJAVA components with scenario 54:12
    • 17: Talend Big Data Hadoop Introduction and Installation 31:58
    • 18: Talend HIVE Components - part 1 47:34
    • 19: Talend HIVE Components - part 2 24:43
    • 20: Talend HDFS Components 58:02
    • 21: Talend TAC 30:32

Course media

Description

Talend - Course Curriculum

Role of Open Source ETL Technologies in Big Data

  • Overview on: TOS (Talend Open Studio) for Data Integration
  • ETL concepts
  • Data warehousing concepts

Talend - Introduction

  • Why Talend?
  • Features
  • Advantages
  • Talend Installation/System Requirements
  • GUI layout (designer)
  • Understanding it's Basic Features
  • Comparison with other market leader tools in ETL domain
  • Important areas in Talend Architecture: Project
  • Workspace
  • Job
  • Metadata
  • Propagation
  • Linking components

Talend: Read & Write various Types of Source/Target System

  • Data Source Connection
  • File as Source
  • Create meta data
  • Database as source
  • Create metadata
  • Using MySQL database (create tables, Insert, Update Data from Talend)
  • Read and write into excel files, into multiple tabs
  • View data
  • How to capture log and navigate around basic errors
  • Role of tLogrow and how it makes developers life easy

Talend: How to Transform Your Business: Basic

  • Using Advanced components like: tMap, tJoin, tFilter, tSortRow, tAggregateRow, tReplicate, tSplit, Lookup, tRowGenerator

Talend: How to Transform Your Business: Advanced 1

  • Trigger (types) and Row Types
  • Context Variables (parameterization)
  • Functions (basic to advanced functions to transform business rules such as string, date, mathematical etc.)
  • Accessing job level / component level information within the job

Talend: How to Transform Your Business: Advanced 2

  • Type Casting (convert data types among source-target platforms)
  • Looping components (like tLoop, tFor)
  • tFileList
  • tRunJob
  • How to schedule and run talend DI jobs externally (not in GUI)

Working with Hierarchical File Structures

  • Read and Write an XML file, configure the schema and XPath expression to parse an XML file
  • Read and Write a JSON file, configure the schema and JSONPath expression to parse a JSON file
  • Read and write delimited, fixed width files.

Context Variables and Global Variables

  • Create context/global variables
  • Use context/global variables in the configuration of Talend components
  • Load context variables from a flow

Best practices

  • Working with databases and implementing data warehousing concepts
  • Working with files (excel, delimited, JSON, XML etc.)

Orchestration and Controlling Execution Flow

  • Files - Use components to list, archive, and delete files from a directory
  • Database – Controlling Commit and Rollback
    • COMMIT at end of job/ every x number of rows
    • Rollback on error

Shared DB connection across jobs and subjobs

  • Use triggers to connect components and subJobs
  • Orchestrate several jobs in master jobs.
  • Handling Errors
    • Kill a Job on a component error
    • Implement a specific Job execution path on a component error
    • Configure the log level in the console statement of accounts

Who is this course for?

Everyone

Requirements

Passion to learn and succeed!

Career path

  • Talend Developer
  • Talend ETL Developer
  • Talend ETL Application
  • Smartstream Technologies
  • Delivery Manager
  • Senior ETL Developer
  • Consultant
  • Talent ETL Application
  • Talent Quality Engineer
  • Lead Consultant
  • Senior Engineer - Talend
  • Talend Big-data Developer
  • Talend Developer ETL Metholdology
  • Senior Consultant
  • Talend with cloud DB
  • Talend Integration Developer
  • Architect
  • Talend Big Data Application Designer

Questions and answers

Currently there are no Q&As for this course. Be the first to ask a question.

Certificates

Certificate of completion

Digital certificate - Included

Course Completion Certificate by Uplatz

Reed courses certificate of completion

Digital certificate - Included

Will be downloadable when all lectures have been completed

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.